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1. Introduction 
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scratch. 


The complex nature of the various methods and the large number of problems 
encountered in determining the equivalent models are not surprising given the fact that 
determining these models for a given complex structure (a large space structure or a wing) 
belongs to a class of problems called inverse problems. These problems are inherently 111- 
posed and it is fruitless to attempt to determine unique continuum models. The present 
work deals with investigating the possibility that a more rational and efficient approach 
of determining the continuum models can be achieved by using artificial neural networks. 

The working mechanism in brains of biological creatures has long been an area of 
intense study. It was found around the first decade of this century that neamna (nerve 
cells) are the structural constituents of the brain. The neurons interact with each other 
rough synapses, and are connected by axons (transmitting lines) and dentrites (receiv- 
ing branches). It is estimated that there are on the order of 10 billion neurons in the 
human cortex, and about 60 trillion synapses (Ref. 1). Although neurons are 5~6 orders 
Of magnitude slower than silicon logic gates, the organisation of them is such that the 
bram has the capability of performing certain tasks (for example, pattern recognition, and 
motor control etc.) much faster than the fastest digital computer nowadays. Besides, the 
energetic efficiency of the brain is about 10 orders of magnitude lower than the best com- 
puter today. So it can be said, in the sense that a computer is an information-processing 
system, the brain is a highly complex, nonlinear, and efficient parallel computer. 

Artificial Neural Networks (ANN), or simply Neural Networks (NN) are compu- 
tational systems inspired by the biological brain in their structure, data processing and 
restoring method, and learning ability. More specifically, a neural network is defined as 
a massively parallel distributed processor that has a natural propensity for storing ex- 
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periential knowledge and making it available for future use by resembling the brain in 

-o aspects: (a) Knowledge is ac q uired by the network through a iearnmg process; (b) 

Inter-neuron connection strengths known as synaptic weights (or simply weights) are used 
to store the knowledge (Ref. 1). 

With a history' traced to the ear,y 1940s, and two periods of major increases in 
research activities in the ear,y 1960 s and after the mid-1980s, ANNs have now evolved to 
a mature branch in the computational science and engineering with a large number 
of publications, a lot of q uite different methods and algorithms and many commercial 
software and some hardware. They have found numerous applications in science and 
engineering, from biological and medical sciences, to information technologies such as 

artificial intelligence, pattern recognition, signal processing and control, and to engineering 
areas as civil and structural engineering. 

port a brief description is given to the most extensively used neural net- 
work in civil and structural engineering, Multi-Layer Feed-Forward NN, and another kind 
of NN, Radial Basis Function NN, which is veiy efficient in some cases. Some concep- 
tions! features of NN are listed. Some examples of the application of neural network are 
given, among which several are published real problems in civil and structural engineer- 
ing. Based on our experience of using the Matlab NN Toolbox, some important and 
very practical issues on the application of NN will be brought out. 

Then a section of the report is dedicated to the development of algorithms car- 
rying out the very useful concept of cross-validation. Through the results for several 

examples obtained from the algorithms, some observations on issues such as over-training 
and network complexity are given. 
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In the last section an example of using NN in continuum models is given. A lattice 
structure with repeating cells is represented by a continuum beam whose properties, as 
functions of the repeating cell particulars, are provided by neural networks. 


2. Examples of NN 


As simplified models of the biological brain, 


ANNs have lots of variations due 


to specific requirements of their tasks 
>ty, type of inter-connection, choice of 
method. 


by adopting different degree of network complex- 
transfer function, and even differences in training 


According to the types of network, there are Single Neuron network (1-input 1- 

oatput, no hidden layer), Single-Layer NN or Percepton (no hidden layer), and Multi- 

Layer NN (1 or more hidden layers). According to the types of inter-connnection, there 

are Feed-Forward network (values can on, be sent from neurons of a layer to the next 

ayer), Feed-Backward network (values can only be sent in the different direction ie 

from the present layer to the previous layer), and Recurrent network (values can be sent 
in both directions). 

follow, ng a brief description is given to two kinds of extensively used neural 
networks and some of the pertinent concepts. 


2.1 Feed-Forward Multi-layer Neural Network 

As shown in Fig. 1(a), in the y-th layer, the i-th neural has inputs from the 
0 - I)-th layer of value **-*(* = 1 „,,,). and has the following output 

4 = m 
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where 


r ? = E - K 

*=1 

“ which is the weight between node k of the (j - l)-th layer and node j of the y-th 
layer, and J? is the bias (also called threshold). The above relation can also be written as 

^ = E<v- 


fc =0 


where 4 *=-1 and 4, _1 = V. 


Qi 


The transfer Junction (also called activation function or threshold function) i 
ally specified as the following Sigmoid function 


is usu- 


f(r) = 


1 + e~ r ' 


Other choices of the transfer function can be the hyperbolic 

1 - e~ r 


tangent function 


f(r) = 


1 + e~ 


the piecewise-linear function 


{ 1. r > 0.5; 

r + 0.5, -0.5 < r < 0.5; 

0, r < -0.5. 

and, sometimes, the ’pure ’ linear function 

}{r) = r. 

These transfer functions are displayed in Fig. 1(b). 

2.2 Radial Basis Function Neural Network 
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Radial Basis Fusion (RBF) NN usually have one iapu , ^ ^ ^ 
and one output layer, as shown in Fig. 2. 

Fo r the RBF network, Fi, 2 , we have the relations ^ ^ ^ 

‘ } a " d ‘ he ° Utput *1 0«« k=l,....„ 3 ) ^ fo „ ows . 


n 2 


** = ! 2 w h r j + bl 


J=l 


n\ 


Tj ~ Yl G ( x \-™),b\ 0 ) 

where w 2 , b 2 are the weights and bias 


1=1 


the transfer function: 


respectively, and the Gaussian function i 


is used as 


G ( x *’ w b Kj) = exp(-[b] d l 2 [ x ] - w)} 2 ) 


where w l is the center 


vector of the input data, and b 1 is the 


variance vector. 


3. Features of ANN 


Some important features of NN are briefed as follows. 

. Many NN methods are universal approximators, in the sense that, given a di 
— (number of hidden layers and neurons of each layer) large enough any ^ 
nuous mappmg can be reaped. Fortunately, the two NNs we are most Interested in 

^ NN a " d "**> -is NN, are examples of such 

approximators (Ref. 4,5). 

. Steps of utilizing NN: specification of the structure (topology)^ training 
(learning) -4 simulation (recalling). 


universal 
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(1) Choosing structural and initial parameters (number of layers, number of neurons of 
each layer, and initial values of weights and thresholds, and the kind of transfer function) 
is usually from experiences of the user and sometimes can be provided by the algorithms. 

(2) The training process uses given input and output data sets to determine the optimal 
combination of weights and thresholds. It is the major and the most time-consuming part 
of NN modeling, and there are lots of methods regarding different types of NN. 

(3) Simulation means using the trained NN to predict output according to new inputs 
(This corresponds to the ’recall 1 function of the brain). 

• The input and output relationship of NN is highly nonlinear. This is mainly 
introduced by the nonlinear transfer function. Some networks, e.g. the so-called ’’abduc- 

tive” networks, use double even triple powers besides linear terms in their layer to layer 
input-output relations (Ref. 6). 

• A NN is parallel in nature, so it can make computation fast. Neural networks 
are ideal for implementation in parallel computers. Though NN is usually simulated in 
ordinary computers in a sequential manner. 

• A NN provides general mechanisms for building models from data, or give a 
general means to set up input-output mapping. The input and output can be continuous 
(numerical), or not continuous (binary, or of patterns). 

• TVaining a NN is an optimization process based on minimizing some kind of 
difference between the observed data and the predicted while varying the weights and 
thresholds. For numerical modeling, which is of our major concern for the present study, 

there is a great similarity between NN training and some kind of least-square fitting or 
interpolation. 
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• Where and when to use NN denend .... ( u 

NN depend 0 n the situation, and NN is not a panacea. 

he following comment on NN application 
generalized in other areas: 


on structural engineering seemingly can be 


Tke real as, fulness of neural networks in structural engineering is not in re P ro- 

“7 “y alsorithmic stnctural as a computa 

ona ye cient alternate, but in proving concise relationships that capture previous 

design and analysis experiences that are useful fnr , • 

/ for making design decisions” (Ref. 9) 

TooI 4 box A1S ° r “ hmS MATlAB N< " ,ral N «™rk 

u put variables. Th,s ,s accomplished by specifying the two matrices „ and ft where p is 
a «• tPatrix; . is the number of input variables, and n the number of sets of training 

7 mat - ' " * “ — variables. Then the number of 

network layers, and numbers of neurons of each layer should be specified. 

algorithms f ° r SPm!ymS mltial values of weights and thresholds in 
at training can be started. For feed-forward NN, function initff is given for this 
purpose. The following is an example of using the algorithm 

HM^M^SHnitff^nl, ■logins, logsig’t, -logsig’); 

’ 2 ’ ^ ^ mitiaI ValU6S f ° r the W6ight matrices of the 1st (hidden), 2nd 

(hidden) and 3rd (output) layer respectively, M are the bia£ (threshol<J) 

^ ^ nUmbW " — * ** - - ~ ««- - respectively, and ^ 
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means that the Sigmoid transfer function is used. 

The present version of Matbab NN Toolbox can support only 2 hidden layers but 

the number of neurons is only limited by the available memory of the computer system 

bemg used. For the transfer function, one can also use other choices, such as tansig- 
yperbohc tangent sigmoid), Wins' ( radia , basis) and ^ 

Experiences of using imtff indicated that it seems tQ be , random ^ ^ 

It is found that the result of the execution of this algorithm each time is different. And 

other conditions kept the same, two execution of this function usually give quite different 

converging h.sto„es of training by the training algorithm (see Fig. 3 (a) and (b)). We 
shall discuss this later in 6.3. 


Shown in the following is the Matbab algorithm for 
with back-propagation: 


training feed-forward network 


[wl, bl, w2, bS, wS, b3, ep, trj—trainbp (wl, bl, ’ logsig\ w 2,b2 , 7 ogsig’, w3 ,b3, ’I 


’logsig’,p t t f tp) ; 


where most of the parameters which the user should take care of have been mentioned 

.o «he above paragraphs. The only set of parameters that the user sometimes need to 

fj IS the 1x4 vector tp , where the first element indicates the number of iterations 

between updating displays, the second the maximum number of iterations of training after 

which the algorithm terminates the training process, the third the converging criterion 

(sum-squared error goal), and the las, being the learning rate. The default value of tp is 
[25, 100, 0.02, 0.01], 


Other algorithms for training: trainbpa (train feed-forward 


and adaptive learning), solverb (design and train radial basis 


NN with back-propagation 


network), and trainlm (train 
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feed-forward NN with Levenberg-Marquardt) 


etc. 


***» and ***, have ve, sitniiar forraats for „ sing „ ^ rf 

[ wl,bl , w2, b2, nr, err] =solverb (p , t. tp ), 
where the algorithm chooses centers for th* r 

of the hdd , “ d increases the "™ber 

Of the hidden layer automatically if the train W „ 

So U is also a designing aigorithm. 

After the NN is trained, one can predict ontpnt from input by usiflg 

7 S " ter ” S ° f ^ 0btainCd Paramet6rS ^r feed-forward 

network one use 


y simuff(x, wl, bl, ’logsig ’, w2, b2, ’logsig ’) 

basisletlT hT matnX> and ’ the PrediCt6d ° UtPUt matriX ' SimiIarly ’ after a radial 

basis network has been trained one uses 


y~simurb(x, wl, bl, w2, b2 ) 


to predict the output. 

Once a NN is trained, we can use the formations in 2., or 2.2 together with the 

obtained parameters (weights etc i tn c D+ 

iweignts etc.j to setup the network to do predict,'™ ^ u 
nnt •, • , Prediction anywhere and 

necessarily within the Matlab environment 
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5. Examples of Application Using MATLAB Neu- 
ral Network Toolbox 


5.1 A Single Input Single Output Function 


The training data set are 


p [0.1, 0.2, 0.3, 0-4, 0.5, 0.6, 0.7, 0.8, 0.9, 1.0]; 
t=[0.01, 0.04, 0.09, 0.16, 0.25, 0.36, O.49, 0.36, 0.25, 0.16]; 

A 1-9-1 (a input layer with 1 input, a hidden layer with 9 neurons, and an output 

layer with 1 output) feed-forward NN was trained with train*,. The comparison between 

the deemed (with symbol V) and the predicted (smooth line) values and the training 

history are shown in Figs, 4 (a) and (b) respectively. Another 1-9-6-1 (an input layer 

with 1 input, two hidden layer with 9 and 6 neurons respectively, and a output layer with 

put) feed forward NN was also trained, and the comparison between the desired 

values and the predicted values and the training history are shown in Figs. 5 (a) and (b), 
respectively. 


By adjusting the learning rate according to the circumstances, adaptive learning 
usally can give better results than with a constant learning rate specified before the 
training bepns, as displayed in Fig. 6 (a) and (b). In Fig. 6 (b), the dotted curve is the 
variation of learning rate, and the continuous curve is the variation of error. 


5.2 Training of Multiple Variable Mappings 

A multi-variable mapping is much more complicated than a single input single 
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output function. If we say for the single input single output function in Sect. 5.1, 10 sets 
of data can give a somewhat complete description of the relation between the input and 
the output, for a mapping with k input variables and one output variable, we will need 
10‘ sets of data to obtain the same degree of completeness of description. That means 
the efforts of training a neural network to simulate a multiple variable mapping increases 
exponentially with the number of input variables. 

For many practical cases, one must obtain a multiple variable mapping from a quite 

limited data set. The worse is, the data are usually randomly distributed in the input 

domain, rather than distributed uniformly as in the cases of experiments designed by the 

DOF (degree of freedom) method. It will be of interest to know the behaviors of NNs for 
such kind of data. 

The mapping shown in Fig. 7 is 

2 

2 = 5O + sin(Axy 2 )), D = {0 < x, y < 1}. 

Firstly we used an 11x11 training data uniformly distributed in the domain D to 

train a 2-10-1 feed forward NN, then the NN was used to simulate the mapping on a 

51x51 testing mesh uniformly distributed in the domain D. The simulated mapping and 

the relative errors, compared to the exact mapping shown in Fig. 7, are given in Figs. 

8(a) and 8(b) respectively. It can be noted that around a corner of domain D, i.e. (* =f, 

y =l), the relative errors have larger values since the abstract values of 2 have a minimum 
at the corner. 

Secondly we trained a 2-10-1 NN from a training data randomly distributed in 
the domain D with the number of data sets n b = 11x11=121. The NN was also used to 
simulate the mapping on the 51x51 testing mesh uniformly distributed in the domain D, 
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and the results are shown in Fig. 9, Sparing Fig. 9(b) with Fig . 8(b) ; we can see 

that a NN trained with random* distributed data has an accuracy a, least one order of 
magnitude worse than that with uniformly distributed training data. 

Then more NNs were trained from training data randomly distributed in the do- 

51*51 testing mesh uniformly distributed in the domain D. One example of these results, 
n 24, is shown in Fig. 10. The error (the maximum of the 51x51 relative testing errors) 
w.th randomly distributed training data as a function of „, compared with the results ob- 
tained using uniformly distributed training data, is shown in Fig. From these resuits 
the following conclusions can be drawn: 


(a) With uniformly distributed training data, the 
than that with randomly distributed training data; 


accuracy of the NN is much higher 


(b)For cases with randomly distributed training data, 
data sets decreases, the averaged relative 
and the error deviation increases either. 


when the number of training 
error increases in an inversely proportional way, 


5.3 A Neural Network to Represent Experimental Data 

This is an application of NN to represent experimental data fitting to mode, a 
structure, problem, namely, the behavior of semi-rigid steel structure connections related 
to the physical connection conditions of the adjacent parts (contact and friction), and local 
purification effects etc. While these highly nonlinear effects prevent a practical rational 
method from being available, neura, network modeling from some experimental data base 
seems to be a good solution to the related structural design and analysis problems 
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Only the case of single-angle beam-to-column connections bolted to both beam and 

column (Fig. 12) would be reproduced. There are 6 sets of experimental data available, 

5 were used for training, and the remaining 1 set for testing. The input data are shown 
in Table 1. 

Following Ref. 7, a 3-50-50-22 feed-forward neural network is used. The network 
has 3 input variables, given in Table 1, two hidden layers of 50 neurons each, and 22 
output variables (the first 21 are moments normalized by the largest at different angles 
of rotation, as specified in Fig. 5 of Ref. 7, and the last variable is a value reflecting 
the largest moment of each set of data, an inverse of the value of the largest moment 
multiplying by a constant making sure that result being less than 1). 

Results with different choices of training sets or different initialization schemes are 
shown in Fig. 13. (a), (b), and(c). Both in (a) and (c) Experiment No.l, 2, 3, 5, 6 are 
used as training sets, and No. 4 is used as the testing set. While in (b) Experiment No.l, 
2, 3, 4, 6 are used as training sets, and No. 5 is used as the testing set. The difference 
between (a) and (c) is that in (a) ’logsig’ is chosen as the transfer function in using the 
algorithm initff to provide initial weights and bias for the training process (the same as 
in (b)), while in (c) ’tansig’ is used. In all the three cases ’ logsig ’ is used as the transfer 
function in the training process. The error goal for the training is 0.5E-5, and the number 
of iterations for convergence were 12377, 6991 and 17118 respectively, in the three cases. 

As shown in Fig. 13, the NN with adequate hidden layers and neurons is flexible 
enough to give almost perfect reproduction of the training data. This is anticipated 
if the training process is converged to the specified small error goal. But what is of 
a great interest is the comparison between the actual and the predicted values for the 
testing data sets, the ones which were not used for the training process. We can see for 
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2™ eXamPfe ’ thC NN “S " « — , the predicted result can 

the :~ m (a)) ' UndereStimate 1“ fa 0»], - good approximation of 
the desired value fas in r ru; 

transfer f„„ f " " "* ^ th<i ** that difte ” ‘ combinations of 

tons are used. As will be seen in the following sections (6.3 and 5.5) when 

the transfer function ’tansig', instead of ’loosin' j s used ' u, , 

9 ’ the aI ®orithm initff , together 

buns’ used in the training algorithm trainbp (that is F 1 ,■ 

Formulation / in 6 3) the net t ■ ° *** ° f 

• J, e network is more robust to give smooth an 

5-4 Neural Networks AppHed to More Structure. Problems 

8 Al, r “ S ^ neUra ' “ fa StrUC ‘ Ura! engineering be *— - Ref 
_r : treated " R6f 8 ^ 9 - • conclusion that 

cl^^pro °^ a P rob * em based on dimensional analysis and domain knowledge 

two robl W 6 ^ ° rmanCe ° f thC ^ Sha ” rePr ° d “ Ce the ""*• <* ‘»e first 

ems with an aim to compare the performances of Matlab NN T Ih 

those of the software used in Ref. g a „ d Ref „ an<J ^ . 

wilt , ’ “ crease our experiences of dealing 

with real structural problems. g 


5.4.1 A Pattern Recognition Problem 

The first problem is related to pattern recognition. The neural network wil, be 
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moments at 10 sections along the span, and the location of the load represents the output 
value. As mentioned in Ref. 9, the training data sets in Ref. 8 have some mistakes in 
them. The data sets shown in Table 2 are the corrected ones. 

We use both multi-layer feed-forward network with backward propagation training 
and the radial basis function network to attack the problem. A 10-3-1 feed-forward NN 
is used, and the results are shown in Fig. 14. With only one hidden layer and 3 neurons 
in the hidden layer, we have produced results that are comparable to those in Refs. 8 
and 9 achieved using 2 hidden layers, each with 10 neurons. This seems to be because 

our neuron number has already been of the order of the number of training data sets (the 
latter is 4). 

The radial basis NN performed better for this problem. With only 3 iterations 
(which means that the Matlab algorithm solverb has changed the network structure only 
3 times, each time increasing a neuron in the hidden layer, until 3 sets of the training data 
have been taken as the center vector), it gives results (see Fig. 15) which are comparable 
with those obtained from the feed-forward network after thousands of iterations. This is 
consistent with the finding that radial basis networks behave well for modeling systems 

With low dimension or isolated "bumps” (Ref. 3) ( the pattern of the beam moment 
distribution is bump-shaped). 

5.4.2 An Ultimate Moment Capacity of a Beam 

The second problem is about the design of a simple concrete beam, that is, selecting 

the depth of a singly reinforced rectangular concrete beam to provide the required ultimate 
moment capacity. 
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5 ' 4 ' 21 A " Multi - InpUt S “^ Output Formulation 


In Ref. 8, the problem was formulafpH int« 

output variable A feed fo 8 maPPmS 5 inPUt VariableS and 1 

e- A teed- forward network with twn , 

used. A set of 21 oat, ' ayWS ° f Six neurons each was 

patterns were random,}- chosen as training data And r ,h 

a set of 10 additional patterns A d f h ' testn, 6' 

patterns were used ( see Table 3 in Ref. g) We ,, 

settings for training the network and the same t ■ ■ 83,1,6 

displayed in Fig , 6(a) „ “ d t6Sting data - The «*«lts are 

8 I6(a) “ C ° mpared ^ose of Ref g in Fig 16(b) . 


S-4.2.2 A Single Input Single Output Formulation 


It has been pointed out in Ref o ,h„, f 

mathematical mode, only t d —dimension analysis of the 

• y tW ° — dtmensional variables are required for the 

t,OD ' H “ C6 th6 *> - * Heh 8 were reorganised in Ref 9 a rd a 

inPUt aDd “ ~~ -ro, and one hidden layer having 3 *“ * 

a simpler relationship, and it was found in R e f 9 that „ , ‘ Th,S * 

as the training data, the network can produce ’ US1 " g ° n ' 5 ’ "* ° f Patter " S 

-tag and testing data (Table 3 m J We use the same 

the same (except more ' ° f the “*«** “re almost 

(except more neurons and one more hidden layer are tried, n , 

IS shown in Fig. 17 ( a ) wh - , . ^ ° ne of the results 

g (a) ’ WhlCh 15 com P ara ble to those obtained in Ref o , 

Fig. 17 (b). net. 9 as displayed in 

There has been a significant experience in using Matlab NN T ,k 
this seemingly simple . , g matlab NN Toolbox to deal with 

S y P e maPPmg Probl6 “- - « =ha„ discuss in the following. 


5.5 An Example of NN Modeling to Data with Noises 
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The example problem used in Ref. 11 will be solved here by both feed-forward NN 

and radial basis function (RBF) NN to show their respective abilities of modeling data 
with noises. 

A set of 51 points sampled uniformly in the x-direction but with randomly chosen 
noises in the y-direction can be generated by the Matlab code as follows. 

P=51; 
x=0:. 02:1; 

sigma=0.15; 

y—.5+4 *sin(10*x) + sigma *(. 5-rand( l,p ) ); 
plot(x,y, ’c+ ’) 


true function, y(x) .5+.4sin(10x), which will be generated with a more dense 
set of points and plotted by dashed line, is sampled without noise. 

pt=500; 

xt=linspace( 0, l,pt ); 

yt=. 5+4 *sin( 10*x)+sigma *(. 5-rand(l,p ) ); 
hold on 

plot(xt,yt,’b-’) ' 

The comparison of the training points with noises specified above, feed-forward 
NN simulation with two hidden layers (1-20-20-1), and the true function is shown in Fig. 
18(a). A similar comparison with a RBF NN simulation is shown in Fig. 18(b). We can 
see that in both cases that the nenrai networks give reasonable and smooth simulation of 
the true function even though the training data has noise. 
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Another set of training points is generated with noises in both x- and y- direction. 
This gives an even worse situation for the neural network to deal with. 

P=51; 
x=0:. 02:1; 
sigmal=0. 01; 

sigma2=0.15; 

x=x+sigmal *(.5-rand(l,p ) ); 

y=.5+4 *sin(l 0 *x) +sigma2 *(. 5- rand(l,p ) ); 
plot(x,y, ’c+’) 

Comparison of the results in this case is shown in Fig. 19(a) through (d). where (a) 
is about the results by a (1-20-20-1) Feed-Forward NN, (b) is about a RBF NN simulation, 
(c) is a summation of the results in (a) and (b), and (d) is about the results by a (1-40-1) 
FF NN trained with adaptive learning. Very good results are also obtained by all neural 
network modeling. As in the first case, the RBF neural network gives the best results. 

Shown in Fig. 20 is the result from the feed-forward NN with transfer function 
combination Formulation , (see 6.3 in the following). In this case, it is difficult for 
the training to converge, and back-propagation with adaptive learning (that means the 
learning rate is adjusted in each epoch) has to be adopted. Moreover, the simulated result 

curve is quite rough for both 1 and 2 hidden layer cases. We shall discuss this further in 

6 . 3 . 


6. On Some Issues of Neural Network Modeling 


6.1 About the Accuracy and Error Goal 
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a "" NN ra ° delins iS to a curve or interpolation. It 

mP ° SSlble in thC PreS6nCe ° f a Hmited ^formation set (the training 

n r can “ the perf — — .— - — inPUt 

6 ° ’ ‘ h thC train “ K and the “»“< <*> se ts include all sorts of noises. 

Ge " era |, ly ' “ " eXPeCt6d that the “ y - » NN cou,d be better in the case of 
interpolation than the case of "extrapolation” (for instance, see Figs. 14 and 15). 

The error goal for the training of a NN' has „ Hi™, • 

„ . „ * n.n has a direct impact on the training time and 

ome influence on the prediction accurac, ,t seems that since usual, the data (whether 
or training or for testing) is polluted by noise, a very small error goal would not be a good 

sometimes may mean worse accuracy for the testing data outside the training set. 

These have been clearly shown in Fig. 21. fn Fig. 21(a) „e can see that as a result 

01 very small error go,, the accuracy of testing is very poor despite a perfect training 
uracy, and in Fig. 21(b) and (c) through the ^ of . moderate ^ both 

the testing and training accuracy are of the same order For the • c- 
. . . e • For the case m Fig. 21(a) the 

training time was about 5 times longer. 


6.2 Normalization of the Training Data 

When the Sigmoid or other transfer functions which squash the output in the range 

d ’ PreParatl °" ° f data * S ^ lmportant before modeling by NN and training 
For feed-forward neural network, the input and output data must be normal, that is 
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*c.ose and .ess than 1, say 0,8. If the original vaIues of the input and ^ ^ ^ 
an 1, it should also be normalized and scaled to a value close to 1. 

We can see this effect clearly in Fie 29 (»\ a t 

moment • S (a) “ d (b) In F «- 22 (a) the ultimate 

moment capacity problem 4 9 o\ • 

blem (5. 4.2, 2) is again solved by multiplying input , by 4 and 

I" 4 ‘ ^ 3 S ° th3t b ° th the larg6St " h - are - > Compared with Fig. 17(a) 
w ere t e number of tterations (epoch) is in thousands) the performance of the NN has 

ZZTT" lhe ^ ^ 016 ^-.smtofa 

gle input single output problem using the formulation 

M, »1, m2, 62, ep, tr ] ] = trainlm( w l, M/ logsig', w2 , 62/ logsig', p, *, tp) . 


The situation can be overcome when the -pure’ linear transf r • 

This is illustrated in Fig 22(c) wh th nCt '°" " USe “' 

‘ 6 the reSUl * f ° r the “« problem in Fig. 22(b) using 


the formulation 


[™l, bl, w2, 62, ep , tr) = trainlm(wl, 61/ logsig', w 2, 62/ 


purelin' ,p, t,tp ); 


is shown. 


For the radial basis NN, although there i 


the input to and output from such 


is no such restriction, still it is better to scale 


a network around (0,1) for an improved performance. 

6.3 Initialization and Specification of Transfer Function 

There are some tricks in using the initialisation algorithms. Firstly, as already 


mentioned, the initialization algorithm initff 


is a random process, so it is possible that 
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training histories of a given problem may be quite different, with one showing a quick 
convergence and the other needing thousands of extra iterations. 

Secondly, the specification of the transfer function involved in initff and trambp 
has very large influences on the training history and the performance of the network. It 

has been found that there are only two combinations of initff and trainbp that are feasible 
in practice, as shown below: 

Formulation I: 


[wl, bl, w2, b2, u>3, b3j— initff (p, nl, ’ logsig’, n2 , ’logsig’, t, ’logsig’); 
fwl,bl,w2, bS, w3,b3, ep, trj=trainbp(wl,bl, ’logsig w2, b2, ’logsig w3,b3, ’logsig’, p,t,tp); 


Formulation II: 


(wi,bl, w2, bS, w3, b8]=initff(p,nl, ’tansy’, nS, 'tansig’,t, ’tansig’); 
l<°lM,wS,bS,wS,b3,ep,trl=tmmbp(u,l,bl, I W,w*,42 , 1 cgsu,’, *3,bS, logsig-, p,t,tp); 


Formulation 1 usually makes the training process oscillate (the error-epoch line 
quite rough) but approach the error goal more directly while the error-epoch line of 
Formulation II is usually quite smooth but at the later part of training the error decreases 
very slowly. See Figs. 4(b), 5(b) ( Formulation II), and Fig. 3(a), 3(b) ( Formulation I) 

As to the performance of a NN, although Formulation I can predict the output in 
the training data set quite well, it sometimes gives very poor testing results outside the 
training data. See Fig. 23 and Fig. 13 (a) and (b). Further, from Fig. 18 to 20, we can 
see while Formulation II gives quite robust and smooth performance Formulation I, in 
contrast, gives unstable and rough predictions when the training data includes noise. 
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So we prefer Formulation II. 


6.4 NN Structural Problem 

The number of the hidden layers and the number of neurons in each layer determine 

the freedom or dimension of a NN. Generally speaking, more hidden layers and more 

neurons the layers mean a larger flexibility of the NN. and accordingly smaller error 
goal can be specified. 

relation between the training time and the numbers of hidden layers and 
neurons is a complicated one. A more complex NN (with more hidden layers and neurons) 
certainly needs more execution time in each training iteration. But a network with more 
layers or neurons may need less iterations for training to read, the desired error goal. 

From Fig. 24 (a) through (h), we can say the adequate degree of freedom of a NN 
(total number of neurons) is about the same order of the number of the training data sets 

More hidden layers and neurons no, only increase the training effort, but sometimes also 
give poor performance for the network. 

7. Cross-Validation Algorithms and Some Obser- 
vations 


Cross-validation is a standard tool in statistics (Ref. 1 ). This techni<)ue fee 
used to optimize neural network structural parameter i.e. numbers of hidden layers and 
neurons of each hidden layer etc. It can also be utilized to determine the proper training 
epoch in order to avoid over-training, the case that usually occurs when the number of 
training iteration is too high, with the symptom that errors for the training data approach 
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zero while those for the testing data increase excessively. 

When a network is being trained using the cross-validation technique, the training 
data is divided into two subsets, one subset being used for training of the network, the 

other being used for validation (i.e. testing). The validation subset is typically 10 to 20 
percent of the training set. 

For the case of single output, the error can be defined as the following combined 

error 


Ejfjvrn — 




1/2 


where n r and n s are numbers of data sets in the training and testing subset respectively, t T 

and t s are the target value for the training and testing subset, and s r and s s are the values 

predicted by the network for the training and testing subset. Other two error measures 
are: 



are also of interest and so is the error ratio E ts /E tr . 


Matlab algorithms cvtram , cvtrainJ are available to train Feed-forward NN using 
cross-validation. They are invoked in the following format 
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[wl 9 bl, w2, b2, w3 , b3j — cvtrain( nl,n2, ic , dn y mn,p , t ); 


or 


[wl,bl, w2, b2, w3, b3]=cvtraml (nl y n2,ic y dn, mn,pr , fr, ps, 

where the training is accomplished by invoking trainbp or trambpa, and descriptions of 
the parameters can be found in their help message (see Appendix;. The only difference 
between cvtrain and cvtmml is that in the former the testing subset is determined in the 
alogorithm in a random manner and takes about 20 percent of the training set, while in 
the latter they are provided beforehand as pr, tr , ps, ts. 

Several calculations have been carried out for two problems. The first problem is 
the example problem used in 5 . 5 , now with 21 points as training subset and 20 points as 
testing subset, and the noise level still being at sigma = 0.15. The second is that in the 
ultimate moment capacity problem ( 5 . 4 . 2 . 1 ), with the 21 sets of original training data 
as training subset, and the 10 sets of testing data as testing subset. Feed-forward neural 
networks with different configurations are used and some of the results are displayed in 
Figs. 25 through 30. It should be noted that for the first problem since there exists a 
real function, we can have two kinds of errors, one based on the noise polluted training 
set, another based on points on the real function curve. Since what' we want to model is 
the exact mapping in terms of the real function, and in practical problems this mapping 

is usually nowhere to be found, the error based on real function should be paid more 
attention. 

Hereafter we look through the results, and some conclusions can be drawn, which 
are true at least for the specific problems. 
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• There indeed exists an optimum epoch or ra „ ge rf epoch ^ ^ 
combined error tends to be minimum, as ca „ be seen clearJy 

25(a), 26(a), 27(a) and 30(a), and partly in Figs. 28(a) and 29(a). The 
optimum epoch for the error based on the real function is Jess than 
that for the error based on the training data, the former being about 

half of the latter. The optimum epoch number decreases as the 
network complexity increases. 

• As number of the training epoch increases, the error on the training 
subset, E tr , tends to decrease indefinitely and approaches aero p r( , 

26(c). 27(c), 29(c), and 30(b). This results in the ratio E„/E tr based 
on training data to increase drasticaily, as in Fig. 26(h), 27(b), and 
(b). Bu, the ratio E„/E tr based on the real function is always in 
the vicinty of 1,0 and very stable (see Fig. 25(b) through 29(b)) The 

'7 " e “ Si " Ce b ° th * here mpresen, the average 

i erence between the network generalization and the real function. 

. As to the probiem of network comp, ex, ty, at ,east for the choices having 
been made, ,t seems that the best performance is given by a network 
mg about the same number of neurons as that of the training data 
-ts, a fact mentioned earlier. This observation is based on the assume 

“° n tha ‘ ‘ he rCaI maPP ‘ ng " - * unaffected by the noise in 

the training and testing data. 


• Cross-validation technique can be 


in NN training. But it is time- 


used to investigate a lot of problems 


consuming and not mature enough to 
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be incorporated into a routine training process. A tentative standard 
for stop to avoid over-training can be E ts /E tr = 2 and Epoch>mn (a 
minimum training number). But this strategy is not so successful (for 
instance, it is not clear how to determine tyitl). 

• Over-training is not so serious a problem as it seems to be. At least 
beyond the optimum epoch the combined error only increases slowly. 

As to the error ratio of testing over training, it is almost a constant (1.0) 
if it is based on the real mapping and if the training and testing data 
are uniformly noise-polluted. This means that training epoch should 

be high enough. Under-training is worse than over-training and should 
be avoided. 

• From the figures it can be seen that for the example problem the train- 
ing of network is locally unstable with one hidden layer while not with 
two hidden layers. From Fig. 30(a) and some other results not included 
here, we can see the situation for the ultimate moment capacity prob- 
lem in 5.4.2. 1 is the same. Obviously, in some way networks with two 
hidden layers are superior to those with only one (Ref. 1). 

• If a testing data is incorporated into a cross-validation process to train 
a network, it contributes (at least indirectly) to the training and loses 
its status as independent testing data. 

8. Methods of Obtaining Continuum Models 

A lot of methods have been used to develop continuum models to represent com 
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plex structures. Many of these methods invoive the determination of the appropriate 
relationships between the geometric and m t • 1 

it , t . -c and material properties of the original structure and 

con inuum models. An important observation is that the continuum model is nol 

Zedt " COntinU ™ “ 0del f ° r 8 giVe " ^ inherent,. 

e ere ore diverse approaches can be used. This can be dearly shown in the 
“ ° f continuum modeis for a iattice structure. 

The si„ g ,e b ay doubie-iaced iattice structure shown in Pig. 3, h, been studied 
properties is properly provided. 

tions rr t method inc,ude the fo,iwing steps (Rcr i2,: (i)imrod “ c ^ — . p- 

egarding the variation of the dispiacements and temperature in the piane of the 
cross section for the beamlike iattice, (.expressing the strains in the individua, dements 

: -ofthestrai„compo„e„tsi„the,sumed coordinate directions, (3) e xp a„di„ g el 

J ram components in a Taylor series, and (.summing up the thermoeiastic strain 

l I r S Wh " the " ^ ““ - Sarnie coef- 

nts the beam mode, in terms of materia, properties and geometry of the origina, 
lattice structure. 8 


In Sun et al (Ref.13), the properties of the continuum model is obtained respectively 

bounta r mati0 ” ° f " rePeati " g 06,1 difee ”* - — — « 

num ' I 7 lt,<>nS FOT eXamP ‘ e ' the Sh6ar ^ " ° btained by Perfomin « a 

“ 7 in a - - - * end Of the repeating cei, 

corresponding shear deformation is emulated by using a finite dement 

e mass and rotatory inertia are calculated with a averaging procedure. 


program. 
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Lee put forward a method that he thought to be more straightforward (Ref. 14). 
He used an extended Timishenko beam to model the equivalent continuum beam. By 
expressing the total strain and kinetic energy of the repeating cell in terms of the dis- 
placement vector at both ends of the continuum model, and equating them to those 
obtained through the extended Timishenko beam theory, he got a group of relations. The 
number of these relations, 2N(1+2N), where N is the degree of freedom of the contnuum 
model, is usually larger than that of the equivalent continuum beam properties to be 
determined. Lee then introduced a procedure in which the stiffness and mass matrices for 
both the lattice cell and the continuum model are reduced and so is the number of rela- 

tmns. Yet how to reduce the number of relations to be equal to the number of unkowns 
seems depend on luck. 

All the above three methods give close results for the continuum model properties, 
and the continuum models also generate promising results for the lattice structure. 

9. An Example of NN Modeling of Continuum 
Models 

Emphasizing the application of NN, we choose an approach similar to that in Ref. 
13, that is, to derive the properties of the beam by investigating the force-deformation 
relationships of the repeating cell in certain boundary conditions. The approach is illus- 
trated in Figs. 32 (a), (b) and (c), where the beam’s axial rigidity EA, bending rigidity 
EL. and shearing ridigity GA are calculated respectively by using the results of finite 
element analysis of the repeating cell in different load conditions. Concerning the finite 
element analysis of 3-D lattice structures one can consult Ref. 15. 

There are five parameters of the repeating cell for the lattice structure in Fig. 
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31 that can be varied, the longitudinal bar length £„, the batten length L„ and the 

longitudinal, batten and diagonal bar area, A c , A , and Aj. Generally, a function with 

more variables will be more complex and it will be more difficult for a neural network 

to simulate its performance. A NN with more input variables needs much more training 

data since in the training data each variable should vary separately As can be shown in 

the following, this kind of “coarse” training data pose an obstacle to most of the training 
algorithms. 

Three scenarios were investigated, with the number of input variables set to be 2, 
3 and 4 respectively. 

9.1 Neural Network with 2 Input Variables 

The input variables are L c and .4,. The number of training data sets is 400=20x20. 
The number of testing data, most of which are located at centers among the training data 
mesh, is also 400-20x20. Part of the results, about GA, is shown in Fig. 33. Simulations 
on the testing data and the relative errors of a FF NN (2-10-1) trained with Levenberg- 

Marquardt (trainlm) are shown in Figs. 33 (b) and (c). Results of a RBF NN doing the 
same job are shown in Figs. 33 (d) and (e). 

We could not say that FF is superior over RBF just because its testing accuracy 
shown in Fig. 33 (c) is higher than RBF’s shown in Fig. 33 (e). We can adjust the 
training criterion to change a RBF NN’s behavior and it should be noted that what is 
shown in Figs. 33 (d) and (e) are not RBF NN’s best performances. 

9.2 Neural Network with 3 Input Variables 


31 



The input variables were chosen as L e , A c and A d . The number of training data 
sets is 343=7x7x7. For this case, the effectiveness of different training algorithms can be 
seen clearly in Fig. 34. When ordinary back-propagation training algorithm, i.e. trainbp 
is used, it is very hard to train the NN to the error level of 10" 1 , as shown in Fig. 34 (a). 
When the adaptive learning technique is included, an improvement can be made, but it is 
still hard to reach the 10" 2 error level, as can be seen in Fig. 34 (b). Now if the algorithm 

with Levenberg-Marquardt ( trainlm ) is used, it is quite easy to push the training error 
level to the order of 10~ 4 . 

The improvement by trainlm is really amazing. All the training algorithms carry 
out an optimization process. While trainbp uses steepest-descent method with constant 
step size, trainbpa accelerates the process by adjusting the step size. On the other hand, 
trainlm adopts a kind of modified Newton’s Methods, which adjusts both the searching 

direction and the step size. Concerning the optimization methods one can consult Ref. 
16. 


Samples of the NN simulation results are given in Table 3, where the desired values 
and values obtained by Noor et al (Ref. 12) and Lee (Ref. 14) are also presented. 

9.3 Neural Network with 4 Input Variables 

The input variables were chosen as L c , A c , A g and A d . The number of training 
data sets is 625=5x5x5x5. For this case only trainlm could train a FF NN that could 
give resonable results. Samples of the NN simulation results are given in Table 4. 
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Appendix Help messages for Matlab 


Algorithms cvtrain and cvtrainl 


> help cvtrain 


[wl,bl,w2,b2,w3,b3]-cvtrain(nl,n2,ic,dn,mn,p, t ) 

Inp M,IDATI0K TRAINING 0F fEED-FORWARD NETWORK BY BACK-PROPAGATION 
Input Parameters : 

nl : Neuron n *£% % £i ~t hidden layer. 

- 1= without Haptiie iSnn^f 6 " ;? yer lif "» input 0 ). 

SS* r ° f tMi nin S & » dapti - learning . 

Minimum training number 

lhln mat J iX ° f input sectors. 

Nu^L ™ trix ° f target vectors. 

Number of input variables 

Number of training data sets 

Return S^tels^ 
w f network weights, 
bi - network biases . 


dn 

mn 

P 

t 

m 

n 

1 


nelp cvtrainl 

[ wl , bl , w2 , b2 , w3 , b3 ] =cvtrai ni ini o • 

or (with pre-training) 1 - n2 » ic, dn, mn,pr, tr, ps , ts ) 

[wl , bl , w2 , b2 , w3 , b3 ] =cvtrainl (nl n 2 ic dn 

- 2 , ic , dn,mn,pr, tr,p S , ts , wl , bl w 2 b 2 >o ■ 
~ ° F “ - back-propagation ' ' 


Input 
nl - 
n2 - 
ic - 
dn - 
mn - 
pr - 
tr - 
ps - 
ts - 
m 

nr - 
ns - 
1 - 
wi - 
bi - 
Return 
wi - 
bi - 


Parameters : 

Neuron nrnSber of the flrst h ^ dden layer. 

1: without adaptive learnincr 1 J den . 1 ®y er (if no input 0) 

1 by nr mltr £ of ' 

T S ns S matrix o°f f *?' 

Number of input variables f ° r testing ' 

Number of training data sets. 

Number of testing data sets. 

Number of output variables. 

network bii?ir(£or c aIJrwit^p?nSai^ in f * 
Parameters: pre training), 

network weights . 
network biases . 
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Experiment ~lVumhpr nf r^u — a — ; 

Number M Angle Tmckness I Angle Lenrth l 

p D T (inch) 

i — I — ! — P— — I 

£ 3 I S -| I ITS 

5 ~ 5 4 ~EL _ [~ 14.S 

I 6 — f- I—- — 

i m — 


TSW * 2 **“"• D »“ '« * Start. Beam e„ b ,.„ 


r* 
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Table 3 Comparison of Continuum Model Properties for 

Lattice Repeating Cell 
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Fig. 3(a) Thp 

histories of twn Uence of initial weights anri u- 


3500 





Sum-Squared Error 


comparison 





^£•5 (a) 1-9 6-1 feed-forward 
comparison between simulation 


“"*** ^-propagation: 

(solid hne) and desired value (V) 



R g-5 (b) 1-9-6 


1 feed -fo™ard NN trained with back 
training history 


propagation: 


Sum Squared Netwo rk Error for 1000 Epochs 



Fig.6 (a) 1 -9-6-1 feed-forward NN trained with BP and adaptive learning- 
comparison between simulation (solid line) and desiredTalue ( + ) S ' 
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Sum- 
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ig. 8(a) Simulated mapping by a 2-10-1 NN trained by 11x11 uniformly 

distributed data 



Fig, 8(b) The relative error compared to the exact mapping (Fig. 7) 
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F.g. 9(a) Simulated mapping by a 2-10-1 NN trained by 121 se ts 
randomly distributed data 



l ) be relative error compared 





1 



1 0 
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Fig. 


10(a) Simulated mapping by a 2-10-1 N N 

randomly distributed data 


trained by 24 sets of 



i o 


Fig. 


10(b) The relative 


error compared to the 


exact 


mapping (Fig. 7 ) 
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Relative Error 


as a function of n/nO 


o 


8 

© 


o: 


Relative Error with randomly distributed training data 
Averaged Relative Error with randomly distributed training data 
Averaged Relative Error with uniformly distributed training data 



Fig. 11 The relative error as a function of n = - 2 . 
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Fig. 21(b) Solving the problem in 5.4.2. 1 with different 
schemes and results: 5-10-10-1 NX. quite large error goal 

epoch=2000 


o: training + : testing 


© 8 , 


+/ O 


Oo + 
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problem m S.4.2.2. 1-9-1, 4* P . 3*t. epoch=468 


o: desired -k predicted *: training data 




u 


0.2 


Rg ' 22 (b) Normalization of 

‘raining data t(.7) = ,. 22 > , ° UtpUt '***-, 

' " ° >S: ^ «ne: simuia 



F ' S ' 22 W Normalization of i nD „t a 

PM “ d °“ tput -iabies: We(tV 



Fig ' 23 So, '™g the p rob]em s 

N -V error goal=l 0 4 ' , by Form ^a l ug_ 

6 IU • e Poch=885l 
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lg ' 24(e) S0,Ving again the problem in 5 4 2 2- 1 

error goa]=l 0 -e. e poch=427 ' ' ^ NN 



M -M-. ,££4n “ 





Combined Error 


Fig. 25(a) Training History for the Toy Problem by a (1-20-1) NN: 

Combined Errors 


0 . 044 , 


Error based on: _ Training data — Real function 


0.042 


0.038 


0.036 


0.034 


1 1 ' ^ ftijjf'jjiW 




Pig- 25(b) Training History for the To d 

R;,, "’ S or T «tin* Error overTn^ £ " ™ 

CJ i tuning Error 
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ig_ 25 (c) Results fo r t h e Tn 




^(h^'p < ^^ eSU * tS *° r [ ^ e T °y Problem by a (1-20-1) NN- rv, 
the Prediction, Trainine and Tpcri nn c\. ' ° J NN - Companson 

" g and Testln S Subsets, and the Real Function 


^Training Subset +; Testing Subset _• Prediction - 


•' Real Function 


Combined Error 


Fl g- 26(a) Training History for the Tov Pmhi u 

Cabined E^ Ob,embya(1 - 50 ' , > NN -- 



90 






Fig. 26(b) Training History for the 
Ratios of Testing Error 


Toy Problem by a (1-50-1) NN: 
over Training Error 


Error based on: _ Training data — Real function 







Fig^ 26(c) Results for the Toy Problem by a (1-50-1) NN: 
Comparison of the Prediction and the Target Values 


o: Training Subset +: Testing Subset Perfection 



0.4 0.5 0.6 

Desired Value 



F 'g- 26(d) Results for the Toy Problem by a ( 1 -50- 1 ) NN- To 

•he Prediction, Training and Testing Subsets = ° mpar,SOn 0f 
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mbin^u Error 


Flg ' 27(a) Training History for the Tov P k, 

Combined Errors° em ^ a (*‘100-1) NN: 





Predicted V a / Ue 
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‘g. 27(d) Res ^ s f^fe T P roblenl ([ )00 NN: Co n 
tne Prediction, Training and Testing Subsets 


Prediction Rea, Function 



Fig. 28(a) Training History for the Toy Problem 

Combined Errors 


a (1-20-20-1) NN: 


Error based on: 


— Trainin § data - Real function 



F,s 2m > „ 


£rro; ' based on; 


— Training data 


^ caJ function 
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ig- 28(d) Results for the Tov p kt 

thC PrediCti0n ' Sfsubsets ' C ° mPariSOn ° f 




29(a) Traini " g H ‘ St0ry ' for . ^ Problem by a (1-50-50-1) NN- 

Combined Errors 


^rror based on: __ Training data — R ea | 


function 



Fig. 29(b) Training History for the Toy Probletr 

Kanos of Testing Error over Tn, ini 


by a (1-50-50-1) NN: 
Error 


Error based on: 


— Trainin 9 data — R ea | function 




Fig. 29(c) Results for the Toy Problem by 
Comparison of the Prediction and the 


a (1-50-50-1) NN: 
Target Values 


^Training Subset +: Testing Subset Perfection 

“ ' “I 1- 
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% 29(d) Results for , he T p 

* e ^ a H -50-50- J) m Cn . 

■ l rai ning and Testing Su ^ C °mpanson of 



*r-n 



Fig. 30(b) Results for fh n 

Comparison 0 f th e Predictiw a^d^the^Target Value's 
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longitudinal bar 


diagonal bar 


eometry of repeating cells of a 
single-bay double-laced lattice 
structure 


Length of longitudinal bars: L c 
Length of battens: L g 
Length of diagonal bars: L d = [p c + £2j 1/2 

Ar eas: A c , A ff , A d . 



Evaluate Continuum Model Properties 

Axial Rigidity EA: 

EA = L c 

^ V 4 v * + ~v 5 -f l Ve 


Fi S- 32 (a) 
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y, v 



Evaluate Continuum Model Properties 


Bending Rigidity El: 

M = 1 X ~Lg = El 

" c 

Vi; - /• 3 r 2 r 

=> El = J Ltstl ^ 

0 ^ 5-^4 


Fig. 32 fb) 
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y, v 



Evaluate Contin 


UUm Model Properties 



Fi g- 32 ( c ) 
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3.5 



Lc(m) 12 0 Ac(m* 2 ) 


Fi S ■ 33 GA = f( A r v , . 

J[Ac,L c ): (a) Ti-aini 


J ng data 
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Fig. 33 GA f(A c , L c ): (b) FF simulation on testing data 



Fig. 33 GA = /(A c , L /) : (c) FF simulation 


errors 


113 


'*■» r ' * 




12 


Fig- 33 CA == /Me, L c ): (d) RBF simulation on testing 



F ig- 33 GA = 


/Me, L c ): (e) RBP simulation 





Fig. 34 (b) Training history of a 3-10-1 FF NN by trainbpa 


116 





